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ABSTRACT 


Davenport-Schinzel  sequences  ^re  ^^"^'^'^  '^.'^"^l 

w^   chnw   that   the   maximal   lengtn  oi   d.  vjay^  ^     ,  ^  ■    .u^ 
^u^rcorposed  of  n  symbols  is  0(n«(n)).  -"ere  aW  -  ^e 

slowly  increasing  to  infinity.  This  is  achieve  J 
equivalence    between    such    sequences    ™.^    ^ener^ized    P 
impression  schemes  on  rooted  trees,  and  then  by  analyzing 

these  schemes. 

conditions: 

(i)  1  <  U/  ^  n  for  each  i . 

(ii)  For  each  i  <  m  we  have  u.*  "';  '^  .  _,  „,h  ,hat 

(iii)  There  do  not  exist  .+2  indices  1  ^  -i  <J.  <  _         -^^  ^  ^ 

",,=  •'.,=  ''15=    ••■   =«.■":=""-"'•- 

We  will  write  \V\  =  m  for  the  Ung,h  of  the  sequence  U. 

Define 

X^(^)  =  max  (M  :  V  \s2.DSin,s)  sequence}  . 

The  estimation  of  X.(.)  is  the  subject  of  this  paper. 

^^;;r;7^— -.  .,  .e  .con.  ..o.  H.  .ee.  .ppo..  in  pa.  ..  a  .an.  .on.  ..  U.S.-U... 
Binational  Science  Foundation. 
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This  problem  has  originally  been  posed  by  Davenport  and  Schinzel  [DS]. 
Their  interest  in  it  arose  from  its  connection  to  the  analysis  of  solutions  of 
linear  differential  equations.  Recently,  Atallah  [At]  has  raised  it  again 
independently,  because  of  its  significance  for  problems  in  dynamic 
computational  geometry.  These  two  applications  are  quite  similar,  and  can  be 
briefly  described  a^»  follows.  Let  /i,  .  .  .  ,/„  be  n  real-valued  continuous 
fimctious  defined  on  a  common  interval  /.  Suppose  that  for  each  i=/=y  the 
f.iRCtioas  /,  'Mid  fj  intersect  in  at  most  s  points  (e.g.,  this  is  the  case  for 
polyDomials  of  fi>:c)d  degree,  or  Chebycheff  systems,  and  so  on).  Let 
g{x)  =  min  {/.(m)  :  :  — 1;  .  .  .  ,n},  for  x  €  I,  he  the  lower  envelope  (i.e.  the 
pointwise  minimuTi)  of  the  /^'s,   and  let  m   be  the  smallest  number  of 

subintervals  /- f„  of  /  such  that  for  each  k  there  exists  an  index  i^  with 

g(x)  =  fijx)  for  all  X  f:  4-  In  other  words,  m  is  the  number  of  connected 
portions  of  the  graphs  of  the//'s  which  constitute  the  graph  of  g.  Assuming 
that  /,,     .     ,/,.  -ire.  arranged  in  this  order  from  left  to  right,  put 

^Vl>   •  •  •  »/n)  —   ('l.   •••.'!«)• 

It  is  now  easily  seen  that  U(Ji,  .  .  .  ,/„)  is  a  DS(n,s)  sequence.  Moreover,  it 
is  icnown  (see  [At])  that  for  any  DS(n,s)  sequence  U  one  can  construct  a 
collection  /i,  .  .  .  ,/„  of  such  functions  for  which  U(fi,  .  .  .  ,f„)  =  U. 
Therefore  the  largest  possible  value  of  m  is  precisely  X,(n). 

Thus,  in  this  setting,  Davenport-Schinzel  sequences  are  strongly  related 
to  the  problem  of  computing  the  (lower)  envelope  of  a  set  of  functions  which 
intersect  eacii  other  in  pairs  in  at  most  some  fixed  number  of  points.  This 
problem  has  many  applications  in  computational  geometry  and  related  areas, 
many  of  which  are  given  in  [At];  some  additional  applications  will  be  noted  in 
Section  7. 

The  problem  of  estimating  Xj{n)  has  been  studied  in  several  papers  [DS], 
[Da],  [RS],  [Sz],  [At].  It  is  known  (and  easy  to  prove)  that  \i{n)  =  n  and 
X2(n)  =  2n-l.  For  j  >  3  the  deep  result  of  Szemeredi  [Sz]  states  that 
X,(n)  =  0{n  \og*n),  where*  the  constant  of  proportionality  depends  on  s, 
and  where  log*n  is  the  smallest  /  >  1  such  that  Cy  >  /i,  where  c^  =  2  and 
€,+1  =  2*'.  (We  use  here  the  following  standard  notations.  Let  /,  ^  be  two 
positive  functions  defined  over  the  integers.  Then  f{n)  =  0{g{n))  (resp. 
/(n)  =  Ci{g{n)))  if  there  exists  a  constant  C  >  0  such  that  f{n)  <  Cg{n) 
(resp.  f{n)  >  Cg{n))  for  all  n.  We  write  f{n)  =  (d{g{n))  if  both 
/(n)  =  0{g{n))  and  f{n)  =  n{g{n))  hold,  namely,  if  there  exist  constants 
Ci,  C2  >  0  such  that  C^g{n)  <  f{n)  <  C2^(")  for  all  n.) 

In  this  paper  all  logarithms  are  with  base  2. 
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However,  it  was  not  known  whether  Xj(/i),  for  s  ^  3,  is  actually 
nonlinear  in  n,  i.e.  whether  Davenport-Schinzel  sequences  of  nonlinear  size 
actually  exist. 

In  this  paper  we  show  that  Xj(«)  is  indeed  nonlinear  for  s  >  3,  and  also 
improve  the  upper  bound  of  Szemeredi  for  j  =  3.  Specifically,  we  show  that 
X3(n)  =  @(na(n)),  where  a(n)  is  the  functional  inverse  of  Ackermann's 
function;  the  function  a(n)  is  very  slowly  growing,  and  tends  to  infinity  with 
n.  The  proof  is  based  on  an  interesting  equivalence  between  Davenport- 
Schinzel  sequences  with  s  —  3  and  sequences  of  certain  operations  performed 
on  arbitrary  rooted  trees,  called  generalized  path  compressions. 

The  paper  is  organized  as  follows.  Section  2  introduces  the  notion  of 
generalized  path  compressions  and  compression  schemes  on  trees.  Section  3 
reviews  the  definitions  of  Ackermann's  function  and  its  functional  inverse, 
and  states  the  main  theorems  of  this  paper.  Section  4  establishes  a  linear 
equivalence  between  DS(n,3)  sequences  and  path  compression  schemes,  so 
that  the  problem  can  be  reduced  to  that  of  estimating  the  maximal  length  of 
such  schemes.  Sections  5  and  6  contain  the  derivations  of  the  upper  and  lower 
bounds,  respectively.  Concluding  remarks,  including  some  applications  of 
these  results,  some  open  problems  and  a  discussion  on  Ackermann's 
functions,  are  given  in  the  final  Section  7. 

2.  Generalized  Path  Compressions  on  Trees. 

Let  r  be  an  arbitrary  rooted  tree.  It  is  given  by  a  triple  {V,  r,  ^), 
whereV  is  a  finite  set  of  vertices  (or,  nodes),  r  6  V  is  the  root  of  T,  and 
<J)  :  V-{r}  -  V  is  the  fatherhood  mapping:  for  each  x  i  V,  x  ¥=  r,  4>{x)  is  the 
father  of  x,  and  j:  is  a  son  of  4>{x).  The  mapping  4)  has  no  cycles,  so  that  by 
repeated  applications  of  cf),  every  x  i  V  is  eventually  mapped  into  r.  A  vertex 
is  called  a  leaf  if  it  has  no  sons,  and  inner  otherwise.  We  will  use  the 
following  standard  notations  for  x,y  €  V:  j:  is  a  descendant  of  y  (and  >»  is  an 
ancestor  of  x)  if  there  are  n  >  1  vertices  x^,  xi,  .  .  .  ,x„  €  V,  such  that 
Xi  =  X,  x„  =  y,  and  ^{x,)  =  x,  +  i  for  /  ^  1,  .  .  .  ,m-1;  a:  is  a  proper 
descendant  of  j  if  n  >  1  (thus  x  i^  y),  and  improper  otherwise  (i.e.  x  =  y). 

We  define  an  operation  on  T,  called  generalized  path  compression  (GPC 
for  short)  as  follows.  Let  xi,X2,  ...  .x^  be  a  sequence  of  nodes  of  T  such  that 
each  X,  is  a  proper  descendant  of  x,  +  i,  for  /=!,...  ,k-l.  The  generalized 
path  compression/  =  (a:i,j:2.  •  •  •  -^k)  is  an  operation  that  modifies  7  so  as  to 
make  each  x,,  for  i  =  I.  .  .  .  ,k  —  l,  a.  son  of  x^.  More  precisely,  /  results  in 
making  (^(xj)  =  X/,  for  /  =  1,  .  .  .  .k-l,  and  leaving  (^(x)  unchanged  for  all 
others:;  see  Fig.  2.1. 


This  notion  generalizes  the  notion  of  standard  path  compression,  in  that 
the  nodes  x-^,  .  .  .  ,x,^  are  not  required  to  be  adjacent  along  their  present  path. 
These  standard  compressions  are  used  in  the  efficient  implementation  of  the 
FIND  operation  in  the  set-union  algorithm  used  for  processing  equivalence 
relationships  (see  [la]  for  an  extensive  analysis  and  earlier  references). 

For  each  GPC/  =  {x-^,  .  .  .  .x^)  call  x^  the  starting  node  of/,  and  denote 
it  by  ctO);  the  length  of  /is  [/"j  =  it-1;  this  is  the  number  of  edges 
{xx^i)'  •  ■  ■  .(j^i-irJ^ik)  Jn/  (Jn  general,  these  are  not  edges  of  T). 

Another  notion  we  need  is  that  of  a  postorder  on  T.  It  is  a  linear  order  of 
the  nodes  of  T  obtained  recursively  as  follows:  Suppose  the  root  r  of  T  has  / 
sons  qi,  .  .  .  ,qi.  llien  a  postorder  on  T  is  obtained  by  concatenating 
postorders  of  the  subtrees  of  T  rooted  at  q^,  .  .  .  ,qi  and  appending  r  at  the 
end.  (Thus  T  can  have  many  postorders,  depending  on  the  order  of  the 
enumeration  or  the  sons  of  each  of  its  nodes.)  An  example  of  a  postordered 
tree  is  given  in  Fig.  2.2. 

Finally,  a  sequence  F  =  (fi,  .  .  .  ,f„)  of  GPCs  on  a  rooted  tree  T  is 
admissible  if 

(i)  it  is  executable,  meaning  that  each  /  is  a  GPC  on  the  tree  7/  obtained 
from  T  after  the  compressions /i,  .  .  .  ,/,_i  have  been  executed  (Ti  =  T)\ 

(ii)  it  is  postordered,  meaning  that  there  exists  a  postorder  on  T  such  that  the 
starting  nodes  (t(/'i),  .  .  .  ,o-(/)„)  are  arranged  in  (weak)  postorder  (thus  more 
than  one  GPC  may  stall  at  the  same  node;  however,  if  <y{fi)  "^  cr(fi+i)  then 
ct(/)+i)  succeeds  (j(fi)  in  the  postorder). 

m 

The  length  \F\  of  such  a  sequence  is  defined  as  ^  \fi\.  We  are  concerned 

here  with  the  maximal  possible  value  of  \F\,  or,  more  precisely,  with  the 
quantity 

x{n,m)  =  max  {|F|  :  F  is  an  admissible  sequence  of  m 

GPCs  on  a  tree  T  with  n  vertices  }  . 

(Note  that  x(".^)  is  well  defined,  e.g.  x(".'")  —  ("~l)'"i  since  \f\  ^  n  —  1 
for  any  GPC /on  r.) 

For  our  estimation  of  x  it  is  useful  to  make  the  following  simplifications: 
Let  a  tree  T  with  n  nodes  and  an  admissible  sequence  F  of  m  GPCs  on  T  be 
given.  For  each  /  =  (xj,  .  .  .  .j:^)  €  F,  we  add  to  T  a  new  leaf  xq,  make  it  a 
son  of  x^,  and  change  /  to  {xq,Xi,  .  .  .  .j:^).  Furthermore,  we  extend  the 
postorder  on  T  so  that  all  new  leaves  of  some  node  x  succeed  all  its  original 
sons,  and  arc  arranged  according  to  the  order  of  the  GPCs  they  correspond 
to.  Since  we  want  to  maximize  \F\,  we  can  assume  without  loss  of  generality 
that  every  node  belongs  to  at  least  one  GPC;  otherwise,  remove  from  the  tree 
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such  a  node  (that  does  not  appear  at  all  in  F),  by  connecting  its  sons  (if  any) 
directly  to  its  father;  add  another  node  to  the  tree  (so  that  it  still  has  exactly 
n  nodes)  as  a  son  of  the  starting  point  of  the  first  GPC  in  F,  and  make  it  a 
leaf  (the  first  in  the  postorder);  finally,  extend  the  first  GPC  so  as  to  start 
from  the  new  node;  all  these  changes  yield  another  tree  v/ith  n  nodes  and  an 
admissible  sequence  F'  whose  length  is  greater  by  1  than  that  of  F.  In 
particular,  each  original  leaf  will  now  have  at  least  one  son  (a  new  leaf). 


All  these  modifications  make  T  into  a  tree  with  n  +  m  vertices:  n  inner 
vertices  and  m  leaves.  Moreover,  in  the  modified  sequence  F  each  of  the  m 
GPC's  starts  at  a  different  leaf.  We  will  refer  to  such  a  T  as  an  {n,m)  —  tree, 
and  to  the  corresponding  F  as  a  compression  scheme  on  T.  Thus,  a 
compression  scheme  on  an  {n,m)  —  tree  consists  of  m  GPC's,  exactly  one  GPC 
starting  at  every  one  of  the  m  leaves;  moreover,  the  leaves  attached  to  any 
node  succeed  all  its  inner  sons  in  postorder.  In  what  follows  we  will  aim  at 
the  estimation  of  the  associated  quantity 

^{n,m)  =  max  {|F|  :  F  is  a  compression  scheme  on  an  (n,m)-tree  }  . 

Note  that  in  the  transformation  above  the  length  of  each  GPC  is  increa3ed  by 
1,  thus  [F|  increases  by  m.   Therefore  ^{n,m)  =  x(''.'")  +  "^  for  all  n,  m. 

It  is  known  (see  [Fi],  [Ta])  that  if  no  restrictions  on  the  order  of  the 
starting  nodes  of  the  GPC's  or  on  the  structure  of  T  are  imposed,  then  (even 
for  standard  path  compressions)  vj;(n,n)  =  ©(n  log  n)  (for  easier  comparison, 
we  state  all  results  in  the  case  m  =  n\  usually  m  =  0(n),  which  of  course 
yields  the  same  bounds) .  If  the  tree  T  is  required  to  be  balanced  and  one  uses 
only  standard  path  compressions,  then  v);(n,n)  =  0(na(n))  (cf.  [Ta]).  Here 
we  show  that,  using  generalized  path  compressions,  and  imposing  no 
restrictions  on  the  structure  of  T,  but  requiring  that  the  GPC's  be 
postordered,  ^{n,n)  is  also  0(/ia(n)).  Although  the  same  bounds  are 
obtained  both  here  and  in  [Ta],  there  is  no  obvious  relation  between  the  two 
problems  (in  particular,  our  results  are  not  proved  by  reducing  the  problem 
to  that  studied  in  [Ta],  although  we  adapt  some  of  the  techniques  in  [Ta]  to 
the  derivation  of  our  lower  bounds;  see  also  Section  6.2). 

3.   Statement  of  Main  Results 

In  this  section  we  state  our  main  results,  concerning  the  functions  X3  and 
^.  For  this  purpose,  we  recall  first  the  definition  of  Ackermann's  function 
("generalized  exponentials"  -  cf.  [Ac]). 


Let  N  be  the  set  of  positive  integers  1,2,...  .  Given  a  function  g  from  a 
set  into  itself,  denote  by  g^'^  the  composition  g  og  o  •••  o  g  of  g  with  itself 
s  times,  for  j  €  N.  Define  inductively  a  sequence  {At}f=i  of  functions  from 
N  into  itself  as  follows: 

Ai(n)  =  2n 

AM  =  Ai"Ji(l)  ,       k^2 
for  all  «  ^  N.  Note  that  for  all  it  >  2,  the  function  A^  satisfies 

At(l)  =  2  , 

A*(«)  =  A,_i(A,(n-l)),       n>2. 
In  particular,  A2(n)  =  2A2(n- 1),  thus  Aj  is  the  "power  function" 

Ajin)  =  2"  ,       n  €N  . 
Then  A 3 (n)  ^  2'^^^"~'^\  thus  A3  is  the  "tower  function" 

A3(n)-22""'  , 

with  n  2's  in  the  exponential  tower,  for  n  €  N.   Finally,  put 

Ain)  =  A„in)  . 

This  is  Ackermann's  function  (actually,  there  are  several  variants  of  this 
function;  their  orders  of  magnitude  are  essentially  the  same,  and  our  results 
do  not  depend  on  which  one  we  use;  see  also  Lemma  3.1  below). 
Ackermann's  function  A  grows  very  fast;  its  first  values  are:  A(l)  =  2, 
A(2)  =  4,  A(3)  =  16  and  A (4)  is  a  tower  of  65536  2's.  For  basic 
properties  of  the  functions  defined  above,  the  reader  is  referred  to  [Ta, 
p.219]  (note  that  his  index  k  is  one  less  than  ours). 

Given  a  strictly  increasing  fuction  g  from  N  into  itself,  its  functional 
inverse  is  the  function  7  from  N  into  itself  given  by 

7(n)  =  min  {j  >  1  :  g(s)  ^  n}  ; 

thus,  -y(n)  =  s  if  and  only  if  g(s-l)  <  n  <  g{s).   In  particular,  let  a^  and  a 
denote  the  functional  inverses  of  A^  and  A,  respectively.   Then,  for  all  n  6  N, 


ai(")  = 


a2{n)  =   r  log  n]  . 
The  functions  a,,  are  easily  seen  to  satisfy  the  following  recursive  formula: 

ai(n)  =  min  {5  >  1  :  a|l)i(n)  =  1}  ; 
that  is,  oLi,{n)  is  the  number  of  iterations  of  a,,^i  needed  to  go  from  n  to  1.  In 


particular,  a^in)  is  precisely  log*n,  as  defined  in  Section  1. 

All  the  functions  ajt  are  non-decreasing,  and  converge  to  infinity  with 
their  argument.  The  same  holds  for  a  too,  which  grows  more  slowly  than  any 
of  the  a^.  Note  that  a(n)  <  4  for  all  n  <  A (4)  which  is  a  tower  with  65536 
2's,  thus  a{n)  <  4  for  all  practical  purposes. 

We  can  now  state  our  results. 
MAIN  THEOREM: 

Thus,  there  exist  constants  C^  ,  C2  >  0  such  that 

Cina(n)  <  X^i")  -  C2na{n) 

for  all  «  ^  1;  we  remark  that  the  constants  are  of  reasonable  magnitude  (very 
coarse  estimates  are  given  in  the  Remarks  at  the  end  of  Sections  5  and  6). 
Note  moreover  that,  in  comparison,  Szemeredi's  result  (for  X3)  can  be  stated 
as  X3(n)  <  Cna^in). 

The  Main  Theorem  is  a  consequence  of  the  following  Theorems,  the  first 
establishing  connections  between  Davenport-Schinzel  sequences  and 
compression  schemes,  and  the  other  two  yielding  (upper  and  lower)  bounds 
for  the  latter. 

Theorem  A:  For  all  n,  m  ^  1, 

X3(n)  <  H2n,n)  ,  (i) 

\\)(n,m)  <  X3(m)  +  (rz  +  m-l)  .  (ii) 


Theorem  B: 


Theorem  C: 


v|;(n,m)  =  0{(n  +  m)a(n)) 


^(2n,n)  =  n{na(n))  . 

Theorems  A,  B  and  C  will  be  proved  in  the  following  three  sections. 

Proof  of  Main  Theorem:  Theorem  B  (with  (2n,n)  for  {n,m))  together  with 
Theorem  A(i)  imply  X3(/i)  =  0{na(n)).  The  other  inequality 
X3(ai)  =  n(na(n))  follows  from  Theorem  C  and  Theorem  A(ii)  (with  (2n,n) 
for  {n,m)).    (Note  that  a(2n)  <  a{n)  +  1  for  all  n.)  Q.E.D. 
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The  inverse  of  Ackermann's  function  has  also  appeared  in  the  analysis  of 
the  union-find  algorithm  in  [Taj.  The  inverse  function  appearing  there  is,  in 
our  notations, 

a^(n)  =  min  {A:  >  1  :  At  +  i(4)  >  log  n} 

(cf.  [Ta,  p. 221],  for  m  =  n);  recall  that  our  function  is 

a{n)  =  min  {/:>!:  Ai,{k)  >  n}  . 

However,  the  two  functions  are  of  the  same  order  of  magnitude: 

Lemma  3.1:  For  all  n  >  1, 

|a(«)  <  a^in)  <  2a(«)  . 

To  show  this,  we  use  the  following 
Lemma  3.2:  For  all  it  >  1  and  s  >  3 

Proof:  It  is  easily  checked  that  Ai,(s)  <  A,,  +  i(s)  for  all  k,  s  >  1.  Then 

Ak^iis)  =■■  At(A,,  i(5-l))  >  A,(A2(5-1))  =  At(2--i)  >  A,is+1)  , 
for  all  f  >  3.  Q.E.D. 

Proof  of  Lemma  3.1:  K  «  <  16,  then  a{n)  <  3  and  a^(n)  =  1.    If  n  >  16, 
then  a(n)  >  4,  and  using  Lemma  3.2  we  obtain  for  k  =  a{n) 

logn<n^A,{k)^A2,-,i4), 

thus  a'' (n)  <  2a(n)-5. 

For  the  converse,  it  is  easily  checked  that  if  a^(n)  =  1  or  2,  then 
a(n)  <  4;  if  a^{n)  =  3,  then  a(n)  <  5.   And  finally,  if  ;t  =  a'^{n)  >  4,  then 

n  <  2^*-(^)  =  A2(A,^i(4))  <  A,(A,^i(4))  =  A,^i(5)  <  A,  +  i(/t+l)  , 
thus  a(n)  <  a^(n)+l.  Q.E.D. 

4.   Linear  equivalence  between  v};  and  X3 

In  this  section  we  prove  Theorem  A,  showing  that  the  two  functions  \\) 
and  X3  are  of  the  same  order  of  magnitude.  This  will  follow  by  using  two 
transformations,  from  DS{n,3)  sequences  to  compression  schemes  and  vice 
versa. 
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4.1.  Transforming  Davenport-Schinzel  sequences  into  compression 
schemes. 

Let  U  =  (ui,W2.  •  •  •  ."m)  be  a  DS(n,3)  sequence.  Define,  for  each 
1  =  1,...  ,/i, 

it,  =  min  {p  :  Mp  =  i}  , 

Vi  =  max  O  :  Up  =  /}  ; 

that  is,  ^L,  is  the  index  of  the  first  occurrence  of  i  in  U,  and  v,  is  the  index  of 
the  last  such  occurrence.  Without  loss  of  generality  (permuting  I,  ...  ,n  if 
necessary)  assume  that  ^lj  <  ^jl2  <    •  •  •    <  M-n- 

A  chain  c  =  (up,  .  .  .  ,u^)  is  a  maximal  decreasing  contiguous 
subsequence  of  U,  i.e. 

(Up_i  <  )  Wp  >  Up  +  i  >     •  •  •    >  U.J  (  <  u^  +  i)  . 

Chains  are  obviously  disjoint  and  their  union  is  U. 

Lemma  4.1:  Suppose  that  /  =  up-i  <  Wp  =  j.  Then  either  p  — 1  =  v^  or 
3  =  ^j- 

Proof:  Suppose  the  contrary;  then  -y  =  jiy  <  p  — 1  and  8  =  v/  >  |3.  Also, 
since  /  <  j  we  must  have  €  =  ji^  <  jiy  =  7.  Overall,  v/e  obtain  five  indices 
€<'Y<3  — 1<P<8  such  that  m,  =  Wp-i  =  u^  =  i  and  u^  =  u^  =  j,  a 
contradiction  which  completes  the  proof.  Q.E.D. 

Corollary  4.2:  There  are  at  most  2n  distinct  chains  in  U. 

Proof:  By  the  preceding  lemma,  a  chain  can  end  either  at  V/  or  at  p-/— 1,  for 
some  i  =  1,  .  .  .  ,n,  and  there  are  at  most  2n  such  places.  Q.E.D. 

Remark:  The  correct  upper  bound  on  the  number  of  chains  is  actually  2n— 1, 
since  m-i— 1  =  0;  it  is  attained,  for  example,  by  the  sequence  (of  length 
5;: -8) 

12131.  ..ln-1  ln-ln-2  .  .  .  3  2  n  2  n  3  n  .  .  .  nn-1  n  . 

Enumerate  the  chains  of  U  in  the  order  they  occur  as  Ci,C2.  .  .  .  ,Cp, 
where  p  <  2n.  Let  T  be  a  (p,n)-tree,  with  the  p  inner  nodes  \,  .  .  .  ,p 
corresponding  to  the  chains  c^,  .  .  .  .c^;  they  are  arranged  in  a  single  path, 
with  q+\  the  father  of  q,  for  q  =  \,  .  .  .  ,p  —  l\  the  n  leaves  l^,  .  .  .  ,l„ 
correspond  to  the  symbols  I,  .  .  .  ,n  appearing  in  the  sequence  U,  and  are 
attached  to  the  p  inner  nodes  as  follows.  For  each  /,  let  rj  <  ^2  <  •  •  •  <  t^. 
be  the  indices  of  the  chains  that  contain  /;  we  then  attach  the  leaf  /;  to  the 
node  ti  in  T.  Define  a  GPC  //  =  (li,ti,  .  .  .  ,t^)  on  T,  and  let 
F  =  (fi,  .  .  .  Jn)-  The  total  length  of  F  is 
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\F\=±\f,\=±q,=   ±    \cr\  =  \U\  . 


/=1  /=l  r=l 


Proposition  4.3:  The  sequence  F  is  a  compression  scheme  on  T. 

Proof:  For  each  i  =  I,  ...  ,n,  let  Si  be  the  index  of  the  chain  containing  the 
first  occurrence  of  i  in  f/  (i.e.  jji/  €  c^);  since  jx-i  <  jjl2  <  •  •  •  <  n^n  and  the 
chains  are  decre^ising  sequences,  we  have  si  <  S2  <  •  •  •  <  s„,  and  thus  the 
leaves  //  ~  o"(//),  whose  fathers  are  the  nodes  si,  are  indeed  in  postorder. 

Next,  suppose  to  the  contrary  that  F  is  not  executable,  and  let  f,  be  the 
first  GPC  in  F  which  cannot  be  executed.  Write  f,  =  (//,/i,  .  .  .  .t^)  and 
suppose  that  when  //  is  to  be  executed,  /^  is  no  longer  a  descendant  of  r^  +  i 
(note  that  the  leaf  /,  of  t^  has  not  yet  been  involved  in  any  GPC,  and  is  still  a 
son  of  t{).  Then  there  had  to  exist  a  GPC  fj  with  j  <  i  which  has  separated  r^ 
from  /t  +  i;  thus/y  contains  a  node  u  with  r^  <  u  <  /jt  +  i,  and  its  last  node  v  is 
such  that  r^  +  i  <  v.  Since  j  <  i,  we  also  have  Sj  <  Si  =  t^  ^  t/^.  Therefore 
j  6  Cj  ,  c„,  Cy,  and  i  €  c,^,  c,^^^,  implying  that  U  contains  a  forbidden 
subsequence  of  the  form  jijij,  contrary  to  assumption  (note  that  this 
occurs  even  If  t^  =  u,  because  in  the  decreasing  chain  c„  the  symbol  i  must 
precede;).  Q.E.D. 

Proof  of  Theorem  A(i):  The  transformation  described  above  yields 
,\3(n)  <  ^(p,n)  :<  \\i{2n,n),  since  ^  is  monotone  and p  <  2n.  Q.E.D. 

4.2.  Transforming  compression  schemes  into  Davenport-Schinzel 
sequences. 

Let  r  be  an  (/i,m)-tree,  and  let  F  =  C/"i/2.  •  •  •  >fm)  be  a  compression 
scheme  on  T.  We  will  identify  each  GPC  with  the  set  of  all  vertices  of  T 
through  which  it  passes,  except  for  its  last  vertex.  Specifically,  for  each 
fj  ~  (^1'  •  •  •  >Xk-i>^k)y  Jst/^  =  {x^,  .  .  .  ,xt-i}.  Enumerate  the  n  +  m  nodes 
of  T  in  the  given  postorder  as  1,2,  .  .  .  ,n  +  m,  and  define,  for  each 
v=l,  ...,n  +  m,a  sequence 


U^=  I  y  :  1  <  ;•  <  m    and  v  e  /J'  1  , 


where  the  elements  of  each  sequence  U^  are  arranged  in  decreasing  order. 
Let  U  be  the  concatenation 

U=  U,\\U2\\    ■  •  •  11  u„^„  . 

To  obtain  a  Davenport-Schinzel  sequence  V  from  U,  we  proceed  through  the 
subsequences  U^  in  order,  erasing  the  first  element  of  U^  whenever  it  equals 
the  preceding  non-erased  element  of  U;  in  total,  at  most  n  +  m -I  elements 
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are  erased. 

Proposition  4.4:  The  sequence  V  is  a  D5(m,3)  sequence  of  length 

|V|>  lF|  -  (n  +  m-l)  . 

Proof:  V  is  composed  of  m  symbols;  the  erasing  procedure  and  the  fact  that 
each  sequence  U^  contains  distinct  elements  imply  that  no  two  consecutive 
elements  of  V  are  equal.  For  each/j,  €  F,  its  index  j  appears  \fj\  times  in  U 
(the  last  vertex  of  fj  is  ignored),  thus  \U\  =  [F|  and  Ivj  satisfies  the  required 
inequality. 

Finally,  U  -  and  thus  V  too  -  does  not  contain  a  subsequence  of  the  form 
i  j  i  j  i.  Suppose  the  contrary;  then  there  exist  vertices* 

in  T,  and  two  distinct  GPC's/,,/y  i  F  such  that 

j  ^U^,U,, 

and  moreover  these  five  occurrences  of  i  and  j  appear  in  U  in  the  order 
/  j  i  j  i.  Let  us  denote  this  illegal  subsequence  as  Q. 

Without  loss  of  generality,  we  can  assume  that  Q  h  the 
(lexicographically)  leftmost  occurrence  of  such  an  illegal  subsequence  in  U. 
In  particular,  no  j  appears  in  U  before  the  occurrence  of  j  in  U^  (for 
otherwise  we  would  have  an  illegal  subsequence  of  the  form  j  i  j  i  j  lying  left 
of  Q).  Moreover,  we  can  assume  for  similar  reasons  that  U^  contains  the 
first  occurrence  of  i  in  U  and  that  U^  contains  the  first  occurrence  of  j  in  U. 
By  definition  we  must  have  p  =  ^ifi),  y  =  ^(fj),  so  that  ^  and  y  are  distinct 
leaves  of  T.  Thus  ^  <  y  and  so  i  <  j  since  the  sequence  F  is  postordered. 

Furthermore  we  also  have  8  =?t  g^  for  otherwise  f/j  would  contain  both  i 
and  j,  and  j  would  have  to  precede  /  in  C/j  (since  i  <  j).  But  then  our 
subsequence  Q  would  appear  in  U  in  the  order  /  j  j  i  j ,  contrary  to 
assumption.  Moreover,  7  is  a  leaf  whereas  5  is  not  (f,  starts  at  3  and  then 
passes  through  8),  so  that  we  also  have  7  =s^  8. 

Now  3  <  7  <  8  and  p  is  a  proper  descendant  of  8  (f,  contains  both  of 
them).  Since  the  vertices  of  T  are  arranged  in  postorder,  7  must  also  be  a 
proper  descendant  of  8.  By  the  same  argument,  8  must  be  a  proper 
descendant  of  e  and  €  must  be  a  (possibly  improper)  descendant  of  G. 


All  inequalities  between  vertices  refer  to  the  given  postorder. 
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Let  4  be  the  least  common  ancestor  of  3  and  7  in  T;  then  ^  is  a 
descendant  of  5.  Since  p  =?t  7  are  leaves,  4  is  not  a  leaf,  and  so  t,  i^  p,  7.  Let 
^  be  the  son  of  I  on  the  path  from  7  to  ^;  see  Figure  4.1. 

Since  3  <  7  we  also  have  3  <  C.  and  in  fact  every  leaf  of  the  subtree  T^ 
of  T  rooted  at  i  succeeds  p  in  postorder.  Since  the  GPC's  of  F  are  executed 
in  postorder,  it  follows  that  by  the  time/,  is  executed,  no  GPC  has  yet  passed 
through  any  vertex  of  T^\  in  particular  7  is  still  a  descendant  of  I,  at  that  time. 

Let  T)  be  the  first  vertex  (i.e.  furthest  from  the  root  of  T)  in/,  which  is  a 
(possibly  improper)  ancestor  of  4-  Note  that  since  h  ^  f,,  r\  must  lie  on  the 
path  between  t,  and  5  (including  ^,5). 

After//  is  executed,  7  remains  a  descendant  of  I,  and  thus  of  t]  too.  Let 
p  be  the  last  node  of//  (note  that  9  <  p).  Execution  of//  has  made  t]  a  son 
of  p.  Now  e  lied  originally  strictly  between  t|  and  p  (strictly  because  8  ?^  e 
and  9  i=  p).  Thus,  after  execution  of  /,  t),  and  hence  a  fortiori  7,  will  no 
longer  be  a  descendant  of  e.  But  this  implies  that  fj  cannot  be  executed,  a 
contradiction  which  completer  the  proof.  Q.E.D. 

Proof  of  Theorem  A(ii):  The  above  transformation  plainly  gives  the  required 
mequality.   Q.E.D. 

5.  The  upper  bounds 

In  this  section  we  prove  Theorem  B,  i.e.  establish  an  upper  bound  on  \\} 
(and  hence  also  on  X3).  To  obtain  this  bound  we  first  derive  a  recurrence 
relation  for  the  function  4/. 

Proposition  5.1:  Let  n,  m  ^1,  and  let  t  >  1  be  a  divisor  of  n.  Then  there 
exist  integers  m* ,  m-^,  rtii,  .  .  .  ,m^  >  0  such  that 

b 

m    +  ^  m/  =  m  , 
/=i 

and 

^{n,m)  <  ^{b-\,m*)  +  2/i  +  2m*  +  f  vl;(|-,m/)  .  (*) 

Proof:  It  suffices  to  consider  only  the  extreme  case  of  (n,m)-trees  all  of 
whose  inner  nodes  lie  along  a  single  path;  we  will  refer  to  such  a  tree  as 
being  canonical.  Indeed,  given  any  (M,m)-tree  T,  transform  it  into  a  canonical 
(n,m)-trec  Tq  by  arranging  its  inner  nodes  along  a  single  path,  according  to 
their  given  postorder  (the  leaves  remain  attached  to  the  same  nodes).  It  is 
easily  seen  that  any  compression  scheme  on  T  is  also  a  scheme  on  Tq. 
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Let  then  T  be  any  canonical  (n,m)-tree,  and  let  6  >  1  be  a  divisor  of  n. 
Partition  the  tree  T  into  b  layers  L-^.  .  .  .  ,Lh  as  follows:  Let  1,  .  .  .  ,n  denote 
the  inner  nodes  of  T  in  postorder  (n  is  the  root),  then  the  layer  L,  consists  of 

thep  =  -^  inner  nodes  (z-l)p  +  l,  {i-\)p  +  2,  .  .  .  Jp,  together  with  all  the 
b 

leaves  attached  to  them.  Let  F  be  any  compression  scheme  of  m  GPC's 
starting  at  the  leaves  of  T.  Qassify  the  GPC's  of  F  into  the  following  two 
types. 

(I)  GPC's  that  start  and  end  in  the  same  layer.  For  each  i  =  I,  .  .  .  ,b,\ct  mi 
denote  the  number  of  GPC's  that  start  and  end  in  layer  L,.  The  total  length 
of  these  GPC's  is  at  most 

i   Hf,rni)  .  (1) 

(II)  GPC's  that  contain  vertices  of  more  than  one  layer.  I^t  m*  denote  the 
number  of  GPC's  of  this  type;  plainly,  m*  +  ^  nii  —  m.   Tlie  tocal  length  of 

these  GPC's  is  the  total  number  of  their  edges;  call  an  edge  (x,^,  ^.l)  internal 
if  Xj  and  xj  +  i  belong  to  the  same  layer,  and  crossing  otherwise.  The  layer  L 
to  which  (T(f)  belongs  will  be  referred  to  as  the  starting  layer  of  the  GPC/. 

The  total  length  of  the  GPC's  of  type  II  can  thus  be  written  as 
Ni  +  N2  +  N^,  where: 

Ni  is  the  number  of  internal  edges  within  the  starting  layer  of  their  GPC. 
A^2  is  the  number  of  internal  edges  within  non-starting  layers  of  their  GPC. 
A^3  is  the  number  of  crossing  edges. 

To  estimate  N^  we  argue  as  follows.  Fix  a  layer  L,,  for  i  =  1,  .  .  .  ,b—l, 
and  let  Q,  denote  the  set  of  all  inner  nodes  of  L,  whose  fathers  do  not  belong 

to  Li.  Initially  \Qi\  =  1,  and  we  clearly  always  have  \Qi\  <  —.    Consider  any 

GPC/  which  starts  in  L/,  contains  kj^  =  k'^  >:  1  internal  edges  within  L,,  and 
then  crosses  over  to  another  layer.  Then  /  passes  through  kf  inner  nodes  of 
L/,  all  of  which  become  members  of  Qi  after  execution  of/.  Note  that  only 
the  last  node  of  /  in  L;  could  already  belong  to  Qi  before  execution  of  /;  all 
the  other  kj-  —  I  nodes  certainly  did  not.  Hence  execution  of/  adds  at  least 
kf  —  I  new  nodes  to  Qi,  so  that 

2:  (^/  -  1)  ^  f  -  1  , 

where  summation  is  over  all  fs  of  type  II  as  above.  Let  qi  be  the  number  of 
such  GPC's.   Then  their  total  contribution  to  N^  is 


-14- 


n 


E  ^/  =  ^/  +  S  (^/  -  1)  ^  9/  +  T  -  1  • 
/  / 

fr-i 
Hence,  summing  over  all  layers,  and  observing  that  2)  Qt  ^  rn* ,yNQ  obtain 

/  =  i 

N,  <  2:'  (^,  +  ^  -  1)  ^  m*  +  n  .  (2) 

For  the  estimation  of  N2  we  need  the  following  lemma,  which  establishes 
an  important  property  of  compression  schemes. 

Lemma  5.2:  Let  F  be  a  compression  scheme  on  a  tree  T,  and  let  u  and  v  be 
two  nodes  in  some/  6  F.  Let  w  be  a  node  such  that  u  <  w  <  v  in  the  given 
postorder  (thus,  in  particular,  w  was  initially  a  descendant  of  v).  Then  at  the 
time /is  executed,  w  is  still  a  descendant  of  v. 

Proof:  Suppose  the  contrary,  and  assume  that  /  is  the  first  GPC  in  F  for 
which  this  property  fails  to  hold  for  some  m,  v  €  /.  Then,  arguing  as  in  the 
proof  of  Proposition  4.3,  there  had  to  exist  another  GPC  g  preceding  /  in  F 
which  has  separated  w  and  v;  i.e.  g  contains  a  vertex  z  such  that,  just  before 
executing  g,  w  has  been  a  (possibly  improper)  descendant  of  z,  z  has  been  a 
proper  descendant  of  v,  and  v  has  been  a  proper  descendant  of  the  last  node  r 
of  g.  Then  u{g)  <  u{f)  <  u  <  w  ^  z.  Since  o-(^)  has  been  originally  a 
descendant  of  z,  by  postorder  so  was  o{f).  Let  a,  b  b&  the  two  consecutive 
nodes  of  g  satisfying  a^g)  ^  a  <  aif)  -^  b  ^  z  (they  exist  because 
fj{g),  z  t  g).  Since  the  assertion  of  the  lemma  is  assumed  to  hold  for  all 
GPC's  in  F  preceding  /,  the  node  o-(/)  must  still  be  a  descendant  of  b  just 
before  executing  g.  But  at  that  time  b  is  a  (possibly  improper)  descendant  of 
z  (because  b,  z  €  g).  Hence,  immediately  after  g  is  executed,  b,  and  hence 
(j(f)  too,  are  disconnected  from  v  {b  becomes  a  son  of  r).  But  this  makes  / 
unexecutable,  a  contradiction  which  completes  the  proof.  Q.E.D. 

The  estimation  of  N2  now  proceeds  as  follows.  We  fix  a  layer  L,,  for 
i  =  2,  .  .  .  ,b,  and  bound  the  contribution  to  A^2  ^^^  to  internal  edges  within 
Li.    Let  P,  be  the  set  of  inner  nodes  of  L,  lying  on  the  path  from  the  first 

inner  node  w  of  L,  to  the  root  of  T.    Initially  [P,]  =  -^,  and  P.  always 

b 

contains  at  least  one  node,  namely  w.  Consider  a  GPC/  with  a  crossing  edge 
(ii,v)  entering  L^  and  with  I  =  If  >  \  internal  edges  in  L,.  Let 
V  =  vq,  vj,  .  .  .  ,vi  denote  the  nodes  of/  in  L/.  It  follows  from  Lemma  5.2 
that,  just  before  /  is  executed,  w  is  still  a  descendant  of  vq,  so  that  P, 
currently  contains  all  the  nodes  vq,  .  .  .  ,v,.  After  execution  of  /  all  the  nodes 
vj,  .  .  .  ,v;_i  will  be  deleted  from  P^;  if  /  terminates  in  another  layer 
(succeeding  Li),  then  v,  will  also  be  deleted  from  P^.  Hence  \P,\  is  decreased 
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after  executing  /  by  I*  nodes,  where  //  is  /^  -  1  if  /  terminates  in  L,,  and  is  /y 
otherwise.  Therefore 

where  the  summation  is  over  all  fs  of  this  sort.  Let  p,  denote  the  number  of 
such  GPC's  which  terminate  in  L^ .  Then  the  total  contribution  to  N2  within  L, 
is 

f  f  '^ 

Hence,  adding  these  bounds  for  i  =  2,  .  .  .  ,^,  we  obtain 

A^2^  i  (P/+  ^-  1)^^*-^  ".  (3) 

b 

since  2  P/  —  '"*• 

/  =  2 

Finally,  for  A^3  we  use  the  following  construction: 

Let  T*  be  a  canonical  tree  of  the  following  form.  It  has  b—\  inner  nodes 

y2<  '  •  •  <  Jt,  each  y,  representing  the  layer  L,;  in  addition,  each  y,  has  /, 
leaves  attached  to  it,  where  //  is  the  number  of  GPC's  of  type  11  in  F  whose 
starting  layer  is  L,_i.  To  simplify  noiaiion,  we  will  identify  the  /,  leaves  of  v, 
with  the  starting  leaves  of  these  GPC's.  For  each/j  of  type  II,  let  gj  be  the 
sequence  of  layers  through  which /^  parses,  without  repetitions  (i.e.  each  such 
layer  appears  in  gj  just  once): 

8j  =  Oi.  '2.  •  •  •  .'r) 

(where  1  <  /^  <  ij  <  ••  •  <  /^  <  t).  We  transform  gj  into  a  GPC  //  on 
T*: 

//  =  i^j^yi. yO 

where  Zj  =  cr(fj)  (which  is  a  leaf  of  L/^  in  T).  Note  that  y,  does  not  appear  in 
f*,  whose  length  {[*{  =  r—  1  precisely  equals  the  number  of  crossing  edges  of 
fj- 

Lemma  5.3:  The  resulting  sequence  F*  =  (ff)  is  a  compression  scheme  on 
T*. 

Proof:  First  observe  that  F*  is  postordered  relative  to  the  postorder  on  T*  in 
which  all  auxiliary  leaves  of  any  y,  succeed  yi-i-  Second,  suppose  to  the 
contrary  that  F*  is  not  executable,  and  let/*  be  the  first  nonexecutable  GPC 
inF*: 
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with  z  a  leaf  of  L,^  in  T.  We  must  have  one  of  the  following  two  situations. 

(i)  yi  is  not  a  descendant  of  yi^^^  for  some  2  <  s  <^  r-\.  Then  there  had  to 
exist  a  GPC  h*  preceding  /*,  which  contained  two  inner  nodes  yp  and  y^  of 
T*,  where 

is-P<  h+\  <  Q  • 
Thus  the  origin.?]  GPC  /  contains  some  nodes  a  i  Li^,  b  i  L,^^^,  and  h 
contained  some  nodes  c  i  Lp,  d  ^  L^.  K  a  ^  c,  then 
ct(/i)  <  cr(/)  =  z<a:^c<b<d;  by  Lemma  5.2,  a  and  b  are  still 
descendants  of  c  and  d,  respectively,  when  h  is  executed,  after  which  they 
certainly  become  disconnected.  But  this  makes  /  unexecutable,  so  we  must 
have  c  <  a,  and  thus  p  =  i^.  Moreover,  c  <  z  <  a,  otherwise  z  and  a 
would  be  disconnected  by  h  (by  the  same  argument  as  above) .  Thus  /  starts 
in  L/  ,  hence  s  —  I,  again  a  contradiction. 

(ii)  z  is  not  a  descendant  of  y,^.  Since  no  other  GPC  starts  at  z,  it  follows 
that  J;  +1,  to  which  z  is  attached,  is  not  a  descendant  of  yi^.  Therefore  there 
had  to  exist  a  previous  h*  which  contained  two  inner  nodes  y^  and  y^  of  T* , 
where 

1*1+1  ^  p  <  12  <  q  . 

Hence  h  contained  some  c  i  Lp,  d  i  L^.  But  then  ct(/i)  <  2  <  c  and  h  would 
have   disconnected   z    from   L/^,    so    that  /  would    not   be   executable,    a 
contradiction  which  completes  the  proof.  Q.E.D. 
Corollary  5,4: 

N^^^{b-\,m*).  (4) 

Proof:  Immediate  from  the  preceding  lemma.  Q.E.D. 

Combining  the  estimates  (1)  -  (4),  we  finally  obtain  the  desired 
recurrence  relation  (*),  and  thus  complete  the  proof  of  Proposition  5.1. 
Q.E.D. 

Proposition  5.5:  Let  m,  s  '^\,  k  >  2,  and  suppose  that  n  divides  A,^{s).   ITien 
^\){n,m)  <  {lk-2)ns  +  {2k-\)m  .  .  (**) 

Proof:  We  will  use  (*)  repeatedly  to  obtain  the  series  of  upper  bounds  on  4>, 

stated  in  (**)  for  ^  =  2,3 At  each  step  we  choose  i>  in  an  appropriate 

manner,  and  estimate  ^{b  —  \,m*)  using  the  bound  obtained  in  the  preceding 
step.  This  yields  a  recurrence  relation  on  ^  which  we  solve  to  obtain  a  better 
upper  bound  on  \\). 
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Specifically,  we  proceed  by  double  induction  on  k  and  s.  To  start  this 
iterative  process  with  k  =  2,  suppose  first  that  n  =  A2{s)  =  2\  Choose 
b  =  2  in  (*);  it  is  easily  checked  that  \\i{b-l,m*)  =  ili(l,m*)  =  m*  for  all 
m*,  so  that  (*)  yields 

>\){n,m)  <  2/1  +  3m*  -f  ii;(y,mi)  +  i|i(y,m2)  . 

The    solution    to    this    recurrence    relation,    for    n    a    power    of    2    and 
m  =  m'  +  m-i  +  m2  arbitrary,  is 

\\)(n,m)  :S  2n  log  n  +  3m  . 

(This  is  obvious  for  «  =  1  because  \\}(l,m)  =  m.  Assuming  it  holds  for  — , 
we  obtain 

\\)(n,m)  <  2n  +  3m*  +  4ylog  —  +  3mi  +  3m2  =  2n  log  n  +  3(m*  +  m^  +  m2) 

=  2n  log  n  +  3m  , 
thus  the  claim  follows  by  induction.) 

To  complete  the  argument  for  it  =  2,  note  that  if  n  divides  A 2(5')  -  2' 
then  Ai  is  a  power  of  2  and  log  n  -^  s,  thus 

v|;(n,m)  ^  2n  log  n  +  3m  <  2ns  +  3m  . 

In  particular,  we  have  for  w  =  1,  2 

i|;(n,m)  <  2/1  +  3m  <  (2it-2)/i  +  (2it-l)m 

for  all  it  >  2.  Since  A^iX)  =  2  it  follows  that  (**)  holds  for  each  it  >  2  and 
s  =  \. 

Suppose  next  that  k>  2  and  i'  >  1,  and  that  the  induction  hypothesis  is 
true  for  all  it'  <  ^  and  s'  ^1,  and  for  it'  =  it  and  all  5'  <  s.  Observe  that 
A^{s  —  \)  is  a  divisor  of  At(j)  because  they  are  both  powers  of  2.    Assume 

first  that  n  =  A^{s);  let  t  =  A^{s-i)  and  choose  b  =  —,  which  is  an  integer 

dividing  n  =  Ai,{s)  =  A;t_i(r).    Hence  by  induction  hypothesis  (for  k-l  and 
t)  we  have 

i^(b-l,m*)  <  \\>(b,m*)  <  (2k-4)bt  +  {2k-3)m*  =  (2k-4)n  +  (2k-3)m*  . 

Then  (*)  becomes 

y\){n,m)  <  {2k-4)n  +  {2k-3)m*  +  2/j  +  2m*  +  ^  v|;(r,m,)  . 

Using  the  induction  hypothesis  once  more  (for  k  and  s-\),  we  obtain 

^\>(n,m)  <  {2k-2)n  +  (2/:-l)m*  +   2)   ii2k-2)lis-\)  +  (2A-l)m,)  = 

1-  1 
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=  (2k-2)ns  +  (2k-l){ni*  +  ^  m,)  =  (2k-2)ns  +  (2k-l)m  , 

/  =  i 

b 

because  m*  +  ^  rtii  =  m. 
/=i 
Finally,  assume  n  divides  Af,{s),  say  At,{s)  =  pn.  Let  T  be  a  canonical 
(n,m)-tree  and  F  a  compression  scheme  on  T  of  length  >^{n,m).  Takep  copies 
of  T,  and  construct  a  canonical  (p/i,/?m)-tree  T*  by  concatenating  them  in 
sequence.  Let  F*  be  the  concatenation  of  the  corresponding  p  copies  of  F. 
Since  |F*|  =  p[F]  =  ;?4.<(«,m),  we  obtain 

pi)>(M,m)  <  »l;(pn,pm)  =  il;(At(j),pm)  <  {2k-2)sA^{s)  +  (2/:-l)pm  , 

which,  divided  by  p,  yields  the  required  inequality.    This  completes  the 

proof.   Q.E.D. 

Corollary  5.6:  For  all  n,  m  >  1  and  k  ^  2, 

^\^{n,m)  <  (4k-4)nat{n)  +  {2k-l)m  , 
where  a^  is  the  functional  inverse  of  A^  as  defined  in  Section  3. 

Ms) 
Proof:  Put  s  =  a^in),  so  that  A;,{s-1)  <  n  <  A,,{s).  Let  p  =     >  1, 

then  Ai:(s)  <  (p  +  i)n  <  2pn.  As  in  the  preceding  proof  we  have 

pt^{n,m)  <  \\)(pn,pm)  <  \\f(Ai,{s),pm)  <  (2k-2)sAi,(s)  +  (2k-l)pm 

<  (4k-4)pns  +  {2k-l)pm  . 

Dividing  by  p  we  obtain  the  desired  inequality.  Q.E.D. 

Proof  of  Theorem  B:  We  claim  that 

}\}{n,m)  <  {24n  +  4m)a(n)  (B) 

for  all  n,  m  >1.  Indeed,  assume  first  n  >  4  so  that  a(n)  >  3.  Let 
k  =  2a(/i)  -  3,  then  n  <  Aa(„)(a(n))  <  At(3)  by  a  repeated  application  of 
Lemma  3.2,  hence  ajk(n)  <  3.   Corollary  5.6  yields 

\\i(n,m)  <  (4)t-4)-3n  +  (2it-l)m  <  24na(n)  +  4ma(n)  , 

as  required.  For  n  <  4  the  desired  inequality  can  be  verified  directly  from 
Corollary  5.6.  Q.E.D. 

Remark:  Inequality  (B)  and  Theorem  A(i)  imply 

Xiin)  <  ^\)(2n,n)  <  (48/i  +  4Ai)a(2^i)  <  52n(a(n)+l)  . 
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6.  The  lower  bounds 

In  this  section  we  will  establish  nonlinear  lower  bounds  for  ij;  (and,  a 
fortiori,  also  for  X^  for  all  j  >  3)  which  match  closely  the  upper  bounds  just 
obtained.  This  is  achieved  using  a  construction  similar  to  that  of  [Ta].  In  this 
derivation  we  use  a  sequence  of  functions  fi^  which  are  similar  to  the 
functions  A^,  but  which  are  easier  to  use  in  our  construction. 

6.1.  The  functions  5^  and  their  properties. 

Define  inductively  a  sequence  {fl^}"=i  of  functions  from  the  set 
No  =  NU{0}  into  itself  as  follows. 

B^{s)  =  0         ,  s>0  , 

5,(0)  =1         ,  k^l, 

B,{s)  =  B,is-1)  +  5,_i(2^*(^-^))       ,  ^  >  2,  5  -  1  . 

Here  are  some  properties  of  the  functions  B^ . 
(Bl)        B2is)  =  1,  for  5  >  0. 
(B2)       B^is)  =  J  +  1,  for  J  >  0. 

(B3)        B^{s)  >  2^     ,  with  s  2's  in  the  exponential  tower,  for  j  >  0;  i.e. 
B,is)  >  A,is). 

Indeed,    ^4(0)  =  1,    and    ^4(5)  =  ^4(^-1)  +  2*^^^"^^  +  1  >  2^*'^  '^^    for 
5  >  1. 

(B4)  Each  function  Bi^{s)  is  strictly  increasing  in  j,  for  all  ik  >:  3,  and  each 
sequence  {Bi^{s)}^s,\  is  strictly  increasing,  for  all  5^  >  1. 

To  see  this  note  first  that  B,,{s)  >  1  for  all  yt  >  2  and  s  >  0.    This  implies, 
for  Jt  >  3  and  5  >  0,  that 

B,{s+1)  =  B,{s)  +  B,-i(2**(^))  >  B,{s)  +  1  . 

hence  also  Bi,{s)  >  s+ 1.  Finally,  we  obtain 

fi,  +  i(5)  >  B,(s)  +  1  ,       k^l,  s^l. 

(This  is  clear  for  yt  =  1.  For  it  >  1  we  have  2**"'^'"^^  >2'^s,  hence 

B,,,is)  =  B,^,is-1)  +  5,(2^-'(^-^))  ^  1  +  B,{s)  .    ) 

(B5)        B,,{s)  >  A,_i(.y),  for  it  >  4,  ^  >  1. 

This  is  proven  by  double  induction  on  k  and  s.  (B5)  holds  for  /:  =  4  by  (B3), 
and  holds  for  j'  =  1  since 

5,(1)-  1  +  5,_i(2)>2  =  A,_i(l). 
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For  it  >  4  and  s  >  1,  assume  (B5)  to  hold  for  all  4  <  A:'  <  k  and  s'  ^  1,  and 
for  k'  =  k  and  all  1  <  j'  <  ^.  Then  we  have 

2"*(^-')>  5^(5-1)  >A,_i(5-l), 

hence 

Bk{s)  ^  fi,-i(2**(^-'))  ^  5,-i(A,_i(5-l))  >  A,_2(A,_i(5-l))  =  A,_i(5)  . 

(B6)        2**^'^  ^  ^  <  Ai(5  +  3),  for  it  >  1,  5  >  0. 

This  is  again  proven  by  double  induction  on  k  and  s.    For  ^  =  1  we  have 

2S1W  +  2^4  andAi(5  +  3)  =  2(^  +  3)  >  4. 

For  Jt  -  2,  2*^-^'^  ^  ^  -  8  and  A2(^  +  3)  =  2^  +  ^  >  8. 
For  it  >  3  zjid  s  =  Q,  we  have  2^'^°^  ^  ^  =  8  and 

At(3)  =  A,_,(A,(2))  =  A,_i(4)  >  Ai(4)  =  8  . 

Finally,  for  it  2r  3  and  s  '^\,  put  /  =  5^(5  —  1);  then 

fiu.)  -  r  +  fi,_i(20  <  3(2'  -  1)  +  fi,_i(2') 
which,  by  (B4),  is 

<:  i?i_,(2'  +  3(2'  -  1))  =  B,_i(2'^2  _  3). 
Thus 

^BM  +  2  <  2**-i(^'*^  "  ^)  "^  ^  <  A  _  (2'+2) 
=  A,_i(2''*(^-^)  ^  2)  <  A,.i(A,(.  +  2))  =  A,(5  +  3), 
using  the  induction  hypothesis  twice. 

Note,  as  a  corollary,  that 
(B7)        A^_^{s)  <  By{s)  <  Ajk(5  +  3),  for  it  >  4,  j  >  1, 

so  that  the  sequences  of  functions  B^^  and  A^  are  indeed  of  the  same  order  of 
magnitude. 

6.2.   The  derivation  of  the  lower  bounds 

Although  the  proof  of  the  upper  bounds  in  the  preceding  section  has 
dealt  exclusively  with  canonical  (r2,m)-trees,  we  will  find  it  more  convenient 
in  this  section  to  work  with  (essentially)  symmetric  binary  trees  rather  than 
with  single-path  ones  (of  course,  as  noted  in  Section  5,  any  compression 
scheme  on  an  arbitrary  (n,m)-tree  can  be  translated  to  the  corresponding 
canonical  tree). 

Let  r  be  an  arbitrary  (/i,m)-tree.  Define  a  sequence  of  trees  T{i),  for 
/  >  0,  as  follows:  7(0)  =  7;  to  construct  r(/+l),  take  two  disjoint  copies  of 
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r(i),  introduce  a  new  node  r  as  the  root  of  T{i+\),  and  make  the  roots  of  the 
two  copies  of  7'(/)  sons  of  r  (see  figure  6.1).  Note  that  T{i)  is  an  {ni,m,)- 
tree,  where 

n,  =  {n+i)2'  -  1  , 

nil  =  m-2'  . 

Moreover,  for  any  given  postorder  on  T,  we  obtain  inductively  an  induced 
postorder  on  T(i+  1)  by  taking  in  each  of  the  two  copies  of  T{i)  the  san-ie 
(induced)  postorder.  In  the  sequel,  we  always  consider  only  such  induced 
postorders  on  the  trees  T{i). 

Theorem  6.1:  Let  7  be  a  (l,m)-tree  with  m  >  1,  and  let  k  >  1.  Then  for 
each  i  >  ^^(m)  there  exists  a  compression  scheme  F  on  T(i)  such  that  each  of 
the  m,  GPC's  in  F  is  of  length  k. 

Proof:  We  use  an  argument  similar  to  that  in  [Ta,  Theorem  15],  based  on 
double  induction  on  k  and  m.  Note  that  if  the  assertion  of  the  Theorem  holds 
for  some  i,  it  also  holds  for  all  j  >  i;  it  thus  suffices  to  prove  it  for 
I  =  Bi^{m).  Moreover,  we  remark  that  the  Theorem  holds  for  (/i,m)-trees 
with  arbitrary  n  ^  1  as  well;  to  reduce  it  to  the  case  n  =  1,  ignore  the  inner 
structure  of  the  tree,  and  assume  that  each  leaf  is  connected  directly  to  the 
root. 

Suppose  first  that  *  =  1.  Since  fii(m)  =  0,  it  suffices  to  prove  the 
assertion  for  T  itself.  Indeed,  take  F  to  be  the  sequence  of  GPC's 
((^1,^).  •  •  •  .(^m.O).  where  /j,  .  .  .  ,/„  are  the  leaves  of  T  arranged  in 
postorder,  and  r  is  the  root  of  T. 

Next  suppose  that  k  =  2  (this  step  is  not  essential  to  our  inductive  proof, 
but  is  given  anyway  to  prepare  for  the  following  more  complex  steps). 
Consider  7(1)  =  7(82(5)),  and  let  Tq  denote  its  root.  For  each  leaf  /  of  T(l) 
take  the  GPC  (/,r,ro)  of  length  2,  where  r  is  the  root  of  the  copy  of  T 
containing  /,  and  let  F  be  the  postordered  sequence  of  these  GPC's. 

Now  suppose  that  k  ^  3.  Assume  that  the  assertion  holds  for  all  it'  <  it 
and  all  m'  >  1,  and  consider  first  the  case  m  =  1.  T  consists  of  just  one  leaf  / 
and  a  root  r  (note  that  I  i^  r).  Let  T*  be  the  subtree  of  7(1)  obtained  by 
removing  its  (two)  leaves;  7*  is  a  (l,2)-tree.  Let  /q  =  Bi,_^{2)  and  i  =  ^^^(l); 
then,  by  definition,  i  =  1  +  Iq.  By  induction  hypothesis,  there  exists  a 
compression  scheme  F*  on  7*(/o),  all  of  whose  GPC's  are  of  length  it— 1. 
Now  7*(/o)  may  be  regarded  as  a  subtree  of  (7(l))(/o)  =  7(/),  and  we  will 

extend  each  GPC  /*  =  {x^,  .  .  .  ,X),)  ^  F*  to  a  GPC  /  =  {xq,x^ Xj,)  in 

7(/),  by  adding  to/*  the  leaf  Xq  which  is  the  unique  son  of  .v^.  The  resulting 
sequence  F  is  clearly  postordered  and  executable  in  7(/),  and  each  of  its 
GPC's  has  length  k,  thus  F  is  the  required  compression  scheme. 
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Finally,  let  Jk  >  3  and  m  >  1,  and  assume  that  the  assertion  holds  for  all 
k'  <  k  and  m'  >  1,  and  for  k'  =  k  and  all  m'  <  m.  Let  T  be  a  (l,m)-tree, 
and  let  li,  .  .  .  ,l„  be  its  leaves  arranged  in  postorder.  Let  T*  be  the 
(l,m-l)-tree  obtained  from  T  by  removing  its  last  leaf  l„.  Put 
I'o  =  5t(/?i  -1),  ii  -■=  Bt_i(2'°),  and  i  =  B^{ni)\  thus  /  =  Jq  +  'i- 

Consider  7'(io)  2uid  its  subtree  r*(io)-  By  induction  hypothesis,  there 
exists  a  compression  scheme  on  r*(jo),  all  of  whose  GPC's  have  length  k. 
llius,  starting  at  each  leaf  /j,  .  .  .  ,/„_i  in  each  copy  of  T  in  TQq),  we  have  a 
GPC  of  length  k.  Now  T{i)  =  (r(zo))(/i)  contains  2''  copies  of  T{io),  and  in 
each  of  these  copies  we  have  a  corresponding  compression  scheme  of  the 
above  form.  Refer  to  the  GPC's  in  all  these  schemes  as  GPC's  of  type  I. 

Next  consider  the  subtree  T**  of  /'(/q),  obtained  by  removing  all  its  m-2  ° 
leaves.  Since  T  is  a  (l,m)-tree,  the  leaves  of  T**  are  precisely  the  p  =  2'° 
copies  (in  TQq))  of  the  root  of  T.  Regard  T**  as  a  (l,/7)-tree  by  ignoring  its 
inner  structure,  and  apply  to  it  the  induction  hypothesis  (recall  the  note  at  the 
beginning  of  this  proof).  We  obtain  a  compression  scheme  F**  on  T**'(ii), 
consisting  of  GPC's  of  length  k—1  each,  such  that  only  the  root  and  the 
leaves  of  each  copy  of  T**  (but  none  of  its  intermediate  nodes)  participate  in 
F**.  Now  r**(ii)  is  a  subtree  of  (r(jo))(ii)  =  T(i);  we  extend  each  GPC 
/**  =  (j:i,  .  .  .  ,xi,)  i  F**  to  a  GPC  /  =  (xq^^,  .  .  .  .x,,)  in  T{i)  as  follows: 
Xi  is  the  root  of  some  copy  of  T,  and  we  take  Xq  to  be  the  last  leaf  there  (i.e. 
the  copy  of  /„).  We  will  refer  to  these  GPC's  as  being  of  type  II. 

We  can  now  obtain  a  compression  scheme  on  T{i)  by  merging  the  GPC's 
of  types  I  and  II;  namely,  we  proceed  through  the  leaves  of  T{i)  in  their 
induced  postorder;  for  each  leaf  which  is  a  copy  of  Z^,  .  .  .  ,/„_i  (resp.  of  /^), 
we  take  the  corresponding  GPC  of  type  I  (resp.  type  II)  starting  at  that  leaf. 

The  sequence  F  of  GPC's  obtained  in  this  way  is  the  required 
compression  scheme  on  T{i).  It  is  clearly  postordered,  and  its  executability 
can  be  established  by  proceeding  inductively  through  the  sequence,  and  using 
the  following  arguments: 

(a)  All  the  postorders  considered  here  (on  T{i),  T*{iQ),  and  T*  *{({))  are 
consistent  with  one  another,  since  they  are  all  induced  by  the  original 
postorder  on  T.  This  implies  the  separate  executability  of  the  sequence  of 
GPC's  of  type  II,  and  also  of  these  of  type  I  (in  each  copy  of  r(/o) 
separately,  and  also  all  together). 

(b)  Let  /=(/',...  ,z)  be  a  GPC  of  type  I,  where  /'  is  a  non-last  leaf,  and 
let  r'  be  the  father  of  /'  (i.e.  r'  is  the  root  of  the  copy  of  T  containing  /').  All 
the  nodes  of/  arc  included  in  the  same  copy  of  r(/o);  therefore,  if  g  is  any 
GPC  of  type  U  which  contains  a  node  on  the  path  from  /'  to  z  (excluding  z), 
then  this  node  must  be  r' ,  and  g  is  thus  executed  only  after  f  (since  a{g)  is 
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the  last  leaf  of  r').  By  (a),  the  other  GPC's  of  type  I  also  do  not  affect  the 
executability  of  /. 

(c)  Let  /  =  (/'  ,r'  ,w,  .  .  .  ,z)  be  a  GPC  of  type  II,  where  /'  is  a  last  leaf  and  r' 
is  its  father;  let  q'  be  the  root  of  the  copy  of  r(/o)  which  contains  /'  and  r' , 
then  q'  is  a  (possibly  improper)  descendant  of  u.  The  GPC /is  executable  at 
its  turn,  because:  (i)  /'  belongs  to  no  other  GPC,  thus  it  still  is  a  son  of  r' . 
(ii)  For  any  GPC  g  of  type  I,  if  it  contains  a  node  on  the  path  from  /'  to  z, 
then  all  its  nodes  lie  in  the  copy  of  r(/o)  with  root  q' ,  thus  g  does  not 
disconnect  r'  from  q'  and  u,  and  of  course  does  not  affect  the  other  nodes  of 
/.  (iii)  Finally,  the  other  GPC's  of  type  II  do  not  affect  the  executability  of  f 
(by  (a)). 

This  establishes  the  assertion  of  the  theorem  for  k  and  m  too,  and  thus 
completes  the  proof.  Q.E.D. 

Corollary  6.2:  For  all  n,  m  >  1 


y\>{n,m)  >  -^^m  , 


where 


3  =  3(n,m)  =  max  [k  >  \  \  A^{ 


2m 


n 


+  3)  <  2n]  , 


and  3  =  1  if  there  is  no  such  k. 
Proof:  Since  i|;(n,m)  >  m  always,  we  can  assume  p  >  1.  Consider  first  the 

2m 


case  2m  >  n.  Let  s  = 
obtain  for  all  i  >  B^{s) 


n 


1,  and  apply  Theorem  6.1  to  a  (l,5)-trce,  to 


4/(2 


/  +  i 


1,  s-2')  >  pj2' 


Put  b  =  ^AJs+3)  <  /I   and  i  =  \ogb  -  1;  then  2b  >  2^^^'^^^  by  (B6), 


2-e 

hence  /  ^  Bp(s)  and  we  have 


Now  let  p  = 


Hb-l,f) 


1,  then  pb  <  n  and 


sb 


psb         ris_ 


m.  Using  the  same 


argument  as  in  the  proof  of  Proposition  5.5  we  obtain 


A,{n,m)  >  Mpb,^)  >  pHb,^)  >  p^b-1,^) 


since 


f  and 
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n 


•n  ^  m. 


Note  that  the  compression  scheme  we  have  obtained  has  all  its  GPC's  of 
length  p.  Thus,  for  any  m'  <  m,  we  can  remove  m  —  m'  leaves  together 
with  their  associated  GPC's,  obtaining  a  compression  scheme  consisting  of  m' 
GPC's  of  length  3  each,  on  the  modified  tree.  Thus 

il;(/i,m')>  -^P(«.m)m' 

for  all  m'  <  m.  In  particular,  for  m'  <  m  =  y  ,  we  obtain 

^Kn,m')  >  j3(n,y)m'  ; 

but  by  definition  of  p,  in  this  case  we  have  p(/i,m')  =  ^{n—),  and  this 

completes  the  proof.   Q.E.D. 

Proof  of  Theorem  C:  Apply  Corollary  6.2  to  obtain 

where 

^  =  max  {k>  I:  A^{A)  <  An]  . 
If  it  =  a(n)  >  4,  then 

AM^Akik)  =  A{k)^n^An, 
thus  3  >  a(n),  and 

v|;(2n,n)>  jncx(n).  (C) 

If  a{n)  <  3,  then,  since  we  trivially  always  have  vji(n,m)  >  m,  (Q  holds  in 
this  case  too.   Q.E.D. 

Remark:  Equation  (C)  together  with  Theorem  A(ii)  imply 
\-i{n)  >  y]f{2n,n)  —  3n  >  — «a(/i)  —  3n  . 

Remark:  The  constructions  in  Theorem  6.1  and  Proposition  4.4  can  be 
combined  to  provide  an  explicit  construction  of  DS{n,?>)  sequences  of 
nonlinear  size.  Thus  the  tree  duplication  operation  T{i)  corresponds  to  a 
similar  sequence  duplication  operation,  and  the  inductive  construction  of 
compression  schemes  on  the  trees  T{i)  can  be  translated  to  an  inductive 
construction  of  DS{n,3)  sequences.  However,  it  seems  that  this  explicit 
construction  becomes  much  less  intuitive  than  the  implicit  one  through  GPC's 
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on  trees. 

7.   Concluding  Remarks 

7.1.  Applications 

Besides  the  numerous  applications  noted  in  [At],  whose  complexity 
bounds  can  now  be  given  more  concrete  forms  (using  our  estimate  for  s  =  3, 
or  Szemeredi's  upper  bound  for  s  >  3),  we  note  here  some  additional 
applications. 

(1)  Pointwise  minima/maxima  of  line  segments.  Let  Z^,  .  .  .  ,/„  be  n  line 
segments  in  the  plane,  none  of  which  is  vertical.  For  each  real  xq  let  J{xq)  be 
the  line  segment  /jt  whose  intersection  with  the  vertical  line  j:  =  xq  is  lowest. 
Then  the  smallest  number  m  of  intervals  on  the  j:-axis,  over  each  of  which  / 
is  constant,  is  0(na{n)). 

Proof:  Let  AT  be  a  real  number  greater  than  the  absolute  value  of  the  slope  of 
each  of  the  line  segments  4.  Extend  each  line  segment  /^  to  a  continuous 
piecewise  linear  function  over  the  whole  j:-axis,  by  making  it  linear  with 
slope  -i-AT  on  the  right  of  4  and  linear  with  slope  —AT  on  its  left.  It  is  easily 
checked  that  each  pair  of  these  extended  functions  intersect  in  at  most  3 
points,  so  that  the  required  number  m  of  ;c-intervals  is  at  most  X^^n),  thus 
0(«a(n)). 

Remark:  It  is  an  open  problem  whether  k^^n)  can  actually  be  obtained  in  this 
set-up. 

(2)  Pointwise  minima/maxima  of  piecewise  linear  functions.  Let  /i,  .  .  .  ,/; 
be  /  continuous  piecewise  linear  functions  defined  over  a  common  interval  /, 
and  let  n  be  the  total  number  of  line  segments  constituting  the  graphs  of 
f\,  .  .  .  ,fi.  Then  the  maximal  number  of  line  segments  constituting  the  graph 
of  the  lower  envelope  of  these  functions  is  0{na.{n)),  and  this  envelope  can 
be  computed  in  time  0{na.{n)  log  n). 

Proof:  Take  each  of  the  n  line  segments  appearing  in  the  graphs  of  the/,'s, 
and  extend  it  to  a  continuous  piecewise  linear  function  as  in  (1)  above.  It  is 
plain  that  the  lower  envelope  of  the  /,'s  is  identical  to  the  lower  envelope  of 
these  extended  functions,  from  which  the  first  part  of  our  assertion  follows. 
Calculation  of  this  lower  envelope  within  the  stated  time  bound  can  be 
accomplished  using  a  straightforward  "divide  and  conquer"  algorithm,  which 
partitions  the  collection  of  the  n  extended  functions  into  two  equal  subsets, 
computes  recursively  the  lower  envelope  of  the  functions  in  each  of  these 
subsets,  and  finally  merges  the  two  lower  envelopes  into  the  lower  envelope 
of  all  the  n  functions,  in  time  that  is  linear  in  the  number  of  line  segments 
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forming  these  two  envelopes. 

(3)  Dynamic  sorting.  Consider  the  following  "dynamic"  sorting  problem,  in 
which  we  want  to  sort  a  sequence  5  of  n  numbers  by  repeated  swaps  of 
vmsorted  adjacent  pairs  of  elements,  but  with  the  added  difficulty  that 
eiemenis  are  being  added  and  removed  from  5  during  our  sorting  process. 
Mo;e  precisely,  suppose  that  the  elements  to  be  sorted  are  the  integers 
I,  .  .  .  ,n.  Tnitialiy  S  is  empty.  The  sorting  process  consists  of  a  sequence  of 
operations  on  S,  where  each  operation  is  one  of  the  following: 

(i)  Insert  a  new  eiement  as  the  first  element  of  S  (each  integer  is  inserted  into 

S  just  once). 

(ii)  Delete  the  first  element  of  S. 

(iii)  Swap  anj'  two  adjacent  elements  s,,  St+i  of  S  for  which  Si  >  Si  +  i- 

The  prob-em  at  hand  is  to  estimate  the  maximal  number  C(n)  of  changes 
in  thcyir.s/  place  of  S.  Qearly,  if  no  insertions  or  deletions  from  S  are  applied 
(or  rather,  if  we  first  insert  all  n  elements  into  S,  then  swap  them,  and  finally 
delete  them  all)  then  plainly  C{n)  =  0(n).  In  the  dynamic  case  however  we 
have 

Proposition:  C(;i)  ~  X2(n};thus  C(n)  =  &(na(n)). 

Proof:  Let  U  be  tne  sequence  of  elements  as  they  appear  in  the  first  place  of 
5.  It  is  easy  to  see  that  U  is  a.  DS(n,3)  sequence:  Plainly  no  two  adjacent 
elements  of  U  are  equal.  Suppose  that  there  exist  five  indices 
i  <  j  <  k  <  I  <  m  such  that  Ui  =  u^  =  u„  =  a,  Uj  =  Ui  =  b  and  a  i^  b. 
Then  the  (middle)  appearance  of  a  as  m^  must  have  been  preceded  by  a  swap 
of  a  and  b,  and  must  later  be  followed  by  a  similar  swap.  This  however  is 
impossible,  because  two  elements  can  be  swapped  at  most  once.  Therefore 
C(n)  <  X3(n). 

For  the  converse  statement,  let  U  =  {ui,  .  .  .  ,u„)  be  any  D5(n,3) 
sequence.  We  obtain  from  U  a  dynamic  sorting  process  (with  U  its 
corresponding  sequence  of  first  elements),  as  follows.  As  in  Section  4.1, 
assume  without  loss  of  generality  that  for  any  two  symbols  a  and  b  appearing 
in  LJ,  we  have  a  <  b  \i  and  only  if  the  index  ^.^  of  the  first  occurrence  of  a  in 
U  is  less  than  the  index  \i.^  of  the  first  occurrence  of  b.  We  start  with  an 
empty  sequence  S,  and  for  each  /:  =  1,  .  .  .  ,m  we  do  one  of  the  following 
(sequences  of)  operations  on  5,  where  a  —  U),: 

(i)  If  Uj^  is  the  first  occurrence  of  a  in  V ,  we  insert  a  as  the  first  element  of  S. 

(ii)  Otherwise,  a  presently  appears  somewhere  in  S  (but  not  as  its  first 
element).  We  then  move  a  to  the  first  place  of  S  by  repeated  swapping  of  a 
with  all  elements  lying  in  S  before  it. 
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(iii)  If  Mt  is  the  last  occurrence  of  a  in  U,  we  then  (after  (ii))  delete  a  from 
the  first  place  of  S. 

We  claim  that  we  can  always  perform  the  swaps  in  (ii).  For  suppose  the 
contrary,  and  let  k  be  the  first  place  in  U  where  one  of  the  corresponding 
swaps  in  (ii)  cannot  be  executed.  Let  a  =  u^,  and  let  Z?  <  a  be  an  element 
appearing  in  S  before  a  (so  that  the  required  swap  of  a  with  b  cannot  be 
done).  Since  b  <  a  wc  have  y^i,  <  yi^,  and  moreover  ^i^  <  k  <  v,,,  where  v^ 
is  the  index  of  the  last  occurrence  of  b  in  U  (since  b  has  not  yet  been 
deleted).  We  claim  that  b  must  have  appeared  in  U  also  at  some  place  / 
between  p.^  and  it.  Indeed,  just  after  ji^,  b  lies  in ,5  after  a,  whereas  just 
before  k  they  have  changed  their  order  in  5.  Since  b  <  a  that  must  have 
occured  by  swapping  b  towards  the  first  place  in  S,  and  by  definition  that 
could  have  happened  only  due  to  an  intermediate  appearance  of  b  in  U.  This 
however  implies  that  U  contains  the  illegal  subsequence  b  a  b  a  b  at  the  five 
indices  fi^  <  ^.^  <  /  <  it  <  v^,  contrary  to  assimiption.  This  shows  that 
\i(n)  <  C{n),  and  hence  these  two  functions  are  equal.  Q.E.D.  , 

7.2.  Open  Problems 

(1)  For  s  >  3,  we  do  not  know  if  there  is  any  simple  connection  between 
DS(n,s)  sequences  and  tree  operations  like  GPC's.  An  obvious  open  problem 
is  whether  the  upper  bound  of  [Sz]  on  Xj(n)  can  be  improved  for  .s  >  3.  This 
problem  was  settled  recently,  after  the  original  submission  of  this  paper,  in 
[Shi],  where  Szemeredi's  bound  is  improved  for  s>3  to  boands  which  are 
roughly  of  the  form 

\Xn)  =  0(/ia(n)^(°(")'"'))  . 

However,  no  lower  bounds,  better  than  the  Cl(na{n))  bound  obtained  in  this 
paper,  are  known  as  yet  for  s>3. 

(2)  We  have  already  raised  the  question  whether  X3(n)  can  be  attained  for  the 
pointwise  minimum  of  n  line  segments.  A  related  open  problem  is  whether 
X3(n)  can  be  attained  for  the  minimum  of  n  cubic  polynomials  (it  is  easy  to 
check  that  both  Xi(n)  and  X2(n)  are  attained  for  the  minimum  of  n  linear  and 
quadratic  functions,  respectively). 

(3)  We  have  obtained  the  bounds  on  X3(n)  by  transforming  DS(n,3) 
sequences  into  compression  schemes  of  GPC's.  However,  generalized  path 
compressions  on  trees  may  be  interesting  to  study  for  their  own  sake. 
Various  open  problems  arise  in  connection  with  GPC's,  such  as  that  of 
finding  other  restricted  classes  of  sequences  of  GPC's  for  which  linear  or 
almost  linear  upper  bounds  can  be  established.  For  example,  suppose  wc 
allow  only  standard  path  compressions  (i.e.  path  compressions  whose  nodes 
arc  adjacent  to  one  another  along  their  current  path),  and  still  require  them 
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to  be  executed  in  postorder.  Is  the  maximal  total  length  of  such  a  sequence 
linear  in  the  number  of  nodes  of  the  corresponding  tree? 

(4)  There  is  a  similarity  between  the  two  problems  studied  here  and  in  [Ta], 
in  that  they  both  involve  path  compressions  on  trees  and  they  both  attain 
similaj'  upper  and  U»v/er  bounds.  Is  there  some  general  problem  of  this  kind, 
of  which  both  our  problem  and  that  of  [Ta]  are  special  instances? 

(5)  Consider  the  following  two-dimensional  generalization  of  our  problem: 
I^t  fxix,y).  .  .  .  .fjx,y)  be  a  collection  of  n  continuous  real  valued  functions 
defined  over  some  two-dimensional  rectangular  region  R.  Suppose  that  for 
each  triple  of  distinct  indices  i,j,  k  the  surfaces  of  z  =  fi{x,y),  z  =  fjix,y), 
z  =  fkix,y)  intersect  in  at  most  s  points.  What  is  the  maximal  number  of 
connected  portions  of  the  surfaces  of  the  /,'s  which  appear  in  their  lower 
envelope?  More  precisely,  since  we  are  interested  in  the  overall  complexity  of 
the  lov/er  envelope,  what  is  the  maximal  number  \^^\n)  of  "corners"  of  this 
lower  envelope  (i.e.  points  where  the  minimum  of  the  functions//  is  attained 
simultanecvjsly  by  at  least  three  of  these  functions)?   An  obvious  estimate  is 

Ap)(n)  i?  ~n\    On  the  otlier  hand,  >^P(n)  =  Cl{n^).    Indeed,  such  a  lower 
6 

bound  is  obtained  for  the  following  collection  of  2n  functions 

Mx,y)  =  ix-  -i-)^ 

g,{x,y)  =  2(y-j^y 

for  /  =  1.     .      n.  We  leave  details  to  the  reader. 

Some  iccent  progress  on  this  generalized  2-D  problem  was  made  by 
Livne  and  Sharir  [LS]  (after  the  original  submission  of  this  paper),  who  have 
shown  that  X^^^n)  =  ©(n^),  and  that  \i^\n)  =  Cl{n^a{n)).  Another  recent 
result  [Sh2]  yields  an  upper  bound  of  the  form  0{n\j{n))  on  the  number  of 
corners  on  the  minimum  of  n  bivariate  continuous  functions,  under  some 
additional  assumptions  concerning  these  functions,  the  most  constraining  of 
which  is  that,  in  each  cross  section  x  =  const,  each  pair  of  the  graphs  of  these 
functions  intersect  in  at  most  two  points. 

An  interesting  special  case  of  this  two-dimensional  problem  is  where  the 
functions  in  question  fi{x,y)  are  all  polynomials  of  some  maximal  degree  d 
(so  that  in  particular,  assuming  non-degeneracy,  each  triple  of  them  intersect 
in  at  most  s  =^  d-^  points).  In  this  sf)ecial  case,  every  cross-section  of  the 
graphs  of  these  polynomials  at  x  =  const  or  y  =  const  yields  a  sequence  of 
univariate  functions  each  pair  of  which  intersecting  in  at  most  d~  points,  so 
that  the  lower  envelope  of  every  such  cross  section  behaves  as  a  standard 
Davenport-Schinzel  sequence. 
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7.3.   On  the  role  of  Ackermann's  functions  in  combinatorial  problems 

Although  Ackermann's  function  A  is  very  fast-growing,  one  can  obtain 
functions  which  grow  much  faster  than  A  by  extending  the  recursive 
definitions  from  Section  3  as  follows.  For  each  countable  ordinal  7  we  define 
a  generalized  Ackermann's  function  A^  from  the  set  N  of  positive  integers  into 
itself  in  the  following  transfinite  inductive  manner: 

(1)  Ai(n)  =  2/1  ,      n  €  N  . 

(2)  If  7  is  not  a  limit  ordinal,  say  -y  =  3  +  1,  then  A/^n)  =  AI"\1),    n  ^  N. 

(3)  If  7  is  a  limit  ordinal,  we  associate  with  y  some  fixed  increasing 
sequence  {y{n)}  of  ordinals  less  than  7  whose  limit  is  7.  Then 
A^[n)  =  A^(„)(n),     n  ^N. 

Thus  A  =  A^  (if  we  put  oi{n)  =  n,  for  n  ^  N). 

The  rate  of  growth  of  the  functions  A^  increases  with  7.  For  example, 
A„  is  not  primitive  recursive,  and  in  fact  grows  faster  than  any  primitive 
recursive  function  (see  [Ro]).  The  function  A^^,  where  to  is  the  first 
inaccessible  ordinal  (it  is  the  limit  of  the  sequence  co,  co"-,  co*^",  ,..).  has  the 
following  remarkable  property  (cf.  [Kr]):  Consider  statements  Q  of  the  form 
Vh  ^  P(n,m),  where  P(n,m)  is  a  provably  recursive  first-order  statement  in 
Peano  arithmetic  with  two  free  variables  m,  n  (i.e.  there  is  an  algorithm  for 
deciding  if  P{n,m)  is  true  for  given  n,  m,  and  a  proof  -  in  Peano  arithmetic  - 
that  the  algorithm  always  terminates).  Suppose  that  2  is  a  true  statement, 
and  associate  with  it  a  function  ^q  iN-N,  defined  as  follows:  for  each 
n  €  N,  ^Q{n)  is  the  smallest  m  satisfying  P(n,m).  Then  Q  is  provable  from 
the  Peano  axioms  (i.e.  is  a  theorem  of  first  order  formal  number  theoiy)  if 
and  only  if  there  exists  an  ordinal  7  <  €9  such  that  c^cC")  *^  ^y{")  for  all 
sufficiently  large  n.  A  recent  result  of  Paris  and  Harrington  [PH]  yields  a 
variant  of  Ramsey's  Theorem  which  can  be  expressed  in  the  above  form  Q, 
but  for  which  the  associated  function  4>q  grows  as  fast  as  A^^,  so  that  Q  is  not 
provable  from  the  Peano's  axioms  (see  [KS]  for  details). 

As.  a  matter  of  fact,  even  for  the  first  infinite  ordinal  to,  there  exist  very 
few  "natural"  statements  Q  of  the  form  vn  ^m  P{n,m)  whose  associated 
function  ^q  grows  like  A^  or  faster.  Our  results  on  Davenport-Schinzel 
sequences  may  well  be  one  of  the  most  natural  statements  yet  known  that  can 
be  expressed  in  this  form.  Indeed,  let 

Q    ^    yk^m    \2>{m)  >  km  . 

Our  lower  bound  on  X3  implies  that  Q  is  a  true  statement;  moreover,  both 
lower  and  upper  bounds  imply  that  <^q  grows  as  A^  (note  that  for  fixc-d  k  and 
m    the  statement   X^i^)  —  '^'^    '^   a  provably   recursive  statement  in  Peano 
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arithmetic:  simply  enumerate  all  sequences  of  length  km  composed  of  m 
symbols,  and  check  each  of  them  for  being  a DS{m,3)  sequence). 

Tarjan's  bounds  [Ta]  on  the  complexity  of  the  union-find  algorithm  can 
also  be  translated  into  such  a  statement  Q  with  <})q  growing  as  fast  as  A^. 
Another  example  of  this  sort  arises  in  the  proof  of  Van  der  Waerden's 
theorem  on  the  existence  of  arbitrarily  long  monochromatic  arithmetic 
progressions  in  any  finite  coloring  of  the  integers  (cf.  [GRS]);  however,  in 
this  case  A^  only  serves  as  an  upper  bound  for  the  associated  function  <^q  , 
and  its  actual  rate  of  growth  is  not  known. 

It  is  also  interesting  to  note  that  our  inductive  solution  of  the  recurrence 
equation  (*)  for  vl;  as  given  in  Section  5  can  be  extended  in  a  transfinite 
inductive  manner  to  obtain  upper  bounds  for  \\i  which  involve  the  inverse 
functions  a.^  of  A^  for  ordinals  7  >  w.  Indeed,  recall  that  formula  (B)  there 
yields 

\\i{n,m)  <  Cina^(n)  +  C2ma^{n) 

for  some  constants  Cu  C5  >  0,  Now  choose  b  =  — tt"  in  (*)  (we  finesse 

^      ^  a„(") 

here  details  which  handle  the  case  in  which  a.^{n)  does  not  divide  n;  the 
methods  used  in  Section  5  can  be  applied  here  too) .  Then 

^\>{b-l,m*)  <  CM^oib)  +  C2m*a^ib)  <  C^n  +  Cjm^a^Cn)  , 

so  that 

vl>(/j,m)  IS  (Ci  +  2)n  +  (C2a„(/i)  +  2)m*  +  ^  Ho^^in),m,)  , 

from  which  we  readily  obtain 

il;(«,m)  <  (Ci  +  2)na,  +  i(/i)  +  (C20c^{n)  +  2)m  . 
This  process  can  be  carried  still  further  to  yield,  for  each  k  >  1, 

il;(n.m)  <  (Ci+2A:)na^  +  t(/i)  +  {C20L^{n)  +  2k)m  , 
and  by  appropriate  choice  of  k  we  can  also  obtain 

y\iin,m)  <  D^na2^{n)  +  (C2a^(n)  -t-  D2a2^(n))m  . 

We  can  further  extend  this  transfinite  process  to  obtain  upper  bounds  of  this 
sort  involving  still  larger  ordinals.  However,  for  general  values  of  n  and  m, 
these  new  bounds  do  not  improve  the  bounds  obtained  in  Section  5,  because 
the  coefficients  of  m  keep  increasing.  Nevertheless,  if  one  assumes,  say, 

/m2>0. 
m  =  0{ -^— ) 

au,(") 
then  the  above  bound  gives 
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i|i(n,m)  =  0(na2^in))  , 

which  obviously  improves  upon  0{noL^(n)).    Further  restrictions  on  m  will 
yield  still  better  upper  bounds  for  \\i. 

References 

[Ac]  W.  Ackermann,  Zum  Hilbertschen  Aufbau  der  reellen  Zahlen,  Math. 
Ann.  99(1928),  pp.  118-133. 

[At]  M.  Atallah,  Dynamic  Computational  Geometry,  Proc.  24th  Symp.  on 
Foundations  of  Computer  Science,  1983,  pp.  92-99. 

[DS]  H.  Davenport  and  A.  Schinzel,  A  Combinatorial  Problem  Connected 
with  Differential  Equations,  Amer.  J.  Math.  87(1965)  pp.  684-694. 

[Da]  H.  Davenport,  A  Combinatorial  Problem  Connected  with  Differential 
Equations,  U,  Acta  Arithmetica  17(1971)  pp.  363-372. 

[Fi]  M.J.  Fisher,  Efficiency  of  Equivalence  Algorithms,  in  Complexity  of 
Computer  Computations,  R.E.  Miller  ar>d  J.W.  'Hiatchei,  Eds., 
Plenum  Press,  New  York  1972,  pp.  153-168. 

[GRS]  R.L.  Graham,  B.L.  Rothschild  and  J.H.  Spencer,  Ramsey  Theory, 
Wiley-Interscience,  New  York  1980. 

[Kr]  G.  Kreisel,  On  the  Interpretation  of  Nonfinitistic  Proofs,  II,  J. 
Symbolic  Logic  17(1952),  pp.43-58. 

[KS]  J.  Ketonen  and  R.M.  Solovay,  Rapidly  Growing  Ramsey  Functions, 
Ann.  of  Math.    113(1981),  pp.  267-314. 

[LS]  R.  Livne  and  M.  Sharir,  On  Maxima  of  Functions,  Intersection 
Patterns  of  Curves,  and  Davenport-Schinzel  Sequences,  Proc.  26th 
IEEE  Symposium  on  Foundations  of  Computer  Science,  Portland,  Ore., 
October  1985. 

[PH]  J.  Paris  and  L.  Harrington,  A  Mathematical  Incompleteness  in  Peano 
Arithmetic,  in  Handbook  of  Mathematical  Logic,  J.  Barwise,  Ed., 
North-Holland  1977,  pp.  1133-1142. 

[Ro]  H.  Rogers,  Theory  of  Recursive  Functions  and  Effective  Computability, 
McGraw-Hill,  1967. 

[RS]  D.P.  Roselle  and  R.G.  Stanton,  Some  Properties  of  Davenport- 
Schinzel  Sequences,  Acta  Arithmetica  17(1971),  pp.  355-362. 

[Shi]  M.  Sharir,  Almost  Linear  Upper  Bounds  on  the  Length  of  General 
Davenport-Schinzel  Sequences,  Tech.  Rcpt  29/85,  The  Eskenasy 
Institute  of  Computer  Sciences,  Tel  Aviv  University,  February  85. 


-32- 

[Sh2]     M.  Sharif,  On  the  Two-dimensional  Davenport-Schinzel  Problem,  in 
preparation. 

[Sz]       E.    Szemeredi,    On   a   Problem   by   Davenport   and   Schinzel,  Acta 
Arithmetica  25(1974)  pp.  213-224. 

[Ta]       R.E.    Tarjan,    Efficiency    of    a    Good    but    not    Linear    Set-union 
Algorithm,  J.  Assoc.  Computing  Machinery  22(1975),  pp.  215-225. 


Fig.  2.1.  A  generalized  path  compression 
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Fig.  2.2.  A  postordered  tree 


Fig.  4.1.  Initial  arrangement  of  the  nodes  p,  -y,  ...  in  T.   A  solid  edge  denotes 
a  nonempty  path,  whereas  a  dashed  edge  denotes  a  possibly  empty  path. 


Fig.  6.1.  r(/+l) 
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